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Analysis by SDS-PAGE and GPC-MS of fermented cocoa extracts shows changes in the amount 
and composition of the major proteins, accompanied by formation of complex distributions of peptides. 
MS/MS studies and application of SEQUEST sequencing software have allowed identification of 
two related peptides, a hexapeptidc and a nonapeptide. formed from vicilin. one of the cocoa storage 
pron ins. l ime course studies of the two peptides show different, abundance profiles and indicate, 
iu part, production of the hexapeptide from the nonapeptide. 
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IN I K* sl>l x i h »\" 

The iniMju^ H.ivnr uf chocolate is generated during the 
procrssii!" «•! riKCM beans, the seeds of the tree Then- 
hrnm.i i.fl'.m. which involves fermentaLion. drying, and 
roast inn (/. .'3- Fermentation is a crucial step in 
processing, and iiiif«rmt?nU;d beans do not produce cocoa 
flavor upon roasting (3, '/)• Roasting induces high- 
temperature chemical reactions, believed to be mainly 
Maillard type condensations between reducing sugars 
and the free amino groups of peptides and amino acids, 
during which flavor compounds are produced (5, 6). 

I -"em lent ai ion involves microbiological and enzymatic 
reactions thai lead to extensive breakdown of the cocoa 
seed proteins. The peptides so formed are considered 
likely to he important flavor precursors (7-0. Although 
(he role of peptides in the development of cocoa flavor 
formation has been documented through laboratory 
experiments involving pept idc containing fractions (.9- 
//), the structures of peptides formed during fermenta- 
tion have not been investigated rigorously. 

The advent of atmospheric pressure ionization mass 
spectrometry lias greatly facilitated the determination 
of peptide sequences in protein digest mixtures following 
chromatographic separation (12-15). The main ap- 
proach involves electrospray-mass .spectrometry (ES- 
MS) and tandem mass spectrometry (MS/MS), combined 
with computer prediction of primary structure through 
comparison with databases of known protein sequences 
(16-19). 

Here, we describe the changes in proteins, peptides, 
and amino acids during fermentation of cocoa beans and 
the identification of two peptides by gel permeation 
chromatography (GPC)-MS/MS. 
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FXPKR1MKNTAL PROCEDURES 

Samples. Amclunadn cocoa beans from the Ivory Coast were 
obtained for each day of a seven-day fermentation period. The 
I wans wen: hand peeled, milled (j* s) with cooling (IK A M20 
laboratory mill, Janke & Kunkel. IKA Work Siaufcn. Ger- 
many), and sieved (8 mm). Ground cocoa was stored in the 
dark in airtight containers at 4 G until use. 

Extraction of Proteinaccous Matter from Cocoa. 
Ground cocoa (10 g) was placed in a cellulose extraction 
thimble (30 mm x 100 mm: Whatman. Kent. U.K.) and 
defatted by Soxhlet extraction with //-hexane (24 h). The 
residues were air-dried and stored in the dark in airtight 
containers at 4 'C. 

Defatted cocoa (1 i\) was ex traded with ethanoic acid/water 
1:1 (10 inL) by sonication (f.ahsonic U, Braun/Inotech, Uot- 
tikori. Switzerland: 1 miii). homogenized (Polytron. Kinematica 
AG. Aiglc. Switzerland; I min), and centrifuged (Beckman 
Instruments Inc. Pnllerton. CA; 27000/,' at 4 'C; 10 min). The 
supernatant was filtered (0.45 /mi: I J ail Gclman. Hampshire. 
U.K.) and stored at -20 C. 

SOS-PAGE Analysis of Proteins ami Peptides. Ground 
samples of cocoa (100 m«») were mixed with extraction buffer 
(1 mL. 100 rnM ti is-el hanoale buffer, pi I 8.1. containing 1% 
sodium dodecyl sulfate (SOS) artd 1% dithiothreitol). Mixtures 
wen: healed (f)5 V. for 30 min) to denature the proteins. 
Samples were allowed to cool and subsequently centrifuged 
(18000# for 5 min). Supernatants were filtered (0.2 /mi) and 
prepared for SUS-PACK gels by addition of 4 volume equiv of 
sample buffer (Bio- R ad. glycine system buffer), containing 
0.005% (w/v) hromophenoi blue as tracking dye. SDS-PAGE 
gels were prepared and run according to the method of 
Lacmmli [2(h- 

Preparation of Standards and Extracts for GPC and 
GPC-MS. All standards were purchased from Sigma (Sigma- 
Aldrtch Ltd.. Poole, U.K.). The L-amino acid standard mixture 
(comprising 2.5/miol mL" 1 of alanine, arginine, aspartic acid, 
glutamic acid, glycine, histidine. isoleucine. leucine, lysine, 
met hionine, phenylalanine, proline, serine, threonine, tyrosine, 
and valine and 1.25 zmiol mL" 1 of cystine) was diluted 10 times 
into mobile phase. Individual peptide standards in mobile 
phase (diglycine. 1 mg mL" 1 ; triglycine, 0.8 mg mL'" 1 : tetra- 
glycine. 1 mg mL" 1 : and angiotensin II, 0.1 mg mL" 1 ) were 
injected (20 mL) onto the GPC column. Cocoa extracts were 
thawed and filtered prior to injection (20 //L). 

© 2001 American Chemical Society 
Web 11/21/2001 
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Figure 1. SDS-PAGH of cocoa extracts at different stains of 
fermentation. 

GPC Separation of Amino Acids, Peptides, and Pro- 
teins. A Waters (Mil ford. VIA) I .C system comprising a 600MS 
controller, a 717 autosainpler. anil a 996 phorodiode array 
detector was used for GPC. Instrument control, data process 
ing. and analysis were performed using Waters Millenium 
2010 software. All solvents wore sparged using helium. CI PC 
was performed using a stainless steel Aquagel-OH 30 column 
(;<0U >: 7.5 mm: Polymer Laboratories. Shropshire. U.K.) 
packed with rigid rnacroporous hydropliilic srvrcne polymer 
(panicle size - 8 «in. pore size 30 A). Isocralic: eluiioii was 
performed at a How rate of 0.5 ml. min 1 for 40-60 min usiiu» 
f>0 niM ethanoic acid adjusted to pi I 3.6 with ammonia as 
mobile pha.se. The detector was set to acquire WV-vis aUsor- 
bancc data between 200 and 400 nm. 

CPC-MS. GPC-MS was performed on an ion trap mass 
spectrometer (Finriigan MAT F.CQ. San Jose. CA) coupled with 
a Thermo Separation Products AS3000 auiosampler. an ScM 
membrane degasser. a P4000 gradient pump, and a Thermo- 
Quesi UV2000 UV -vis (215 rim). The MS was operated in the 
elect rospray (ES) ionization, positive ion mode using Hnni'-nn 
Navigator software version 1 .2 and 1 .C condit ions as desct ibed 
above*. KS-V1S instrument settings were as follows: sheath gas. 
80 (arbitrary units); auxiliary gas. 20 (arbitrarv units); spray 
voltage. 1.2 kV; capillary temperature. 275" TJ; capillary 
voltage. 33 V: tube huts offset voltage. 50 V; collision energy. 
15%. 

l ull-scan MS spectra were acquired over the mass ranee 
50-2000 Da. MS/MS spectra were acquired using data- 
dependent scanning with an isolation width of 5 m'z. KG- MS/ 
VIS data were analyzed using SkOUKST software (tiioExplore 
version 3.1. Finnigan. San Jose. CA) lo search against human, 
cocoa vicilin. and cocoa albumin sequence database .subsets 
created from the OWL database (version 290797) (A'/). 

KKSU.TS AM) DISCUSSION 

lixi.racis of unformonted cocoa beans gave four major 
bands on SDS PACK (46. 32. 22. and 1 4 kDa; Kigurci 1) 
as observed previously (9, 22). During formonraiion the 
4b* and 32 kDa proteins, previously identified as vicilins. 
were hydrolyzed rapidly and were essentially absent 
after 2 days. The albumin (22 kDa) and the unidentified 
14 kDa proteins were nor degraded to the same extent, 
and some traces persisted at day 7 (cf. refs 23 and 24). 
The absence of any increase in polypeptides between 46 
and 14 kDa over the fermentation period suggests that 
the proteins break down mainly to components with 
masses < 14 kDa. 

Standards. The suitability of the Aquagel-OH col- 
umn for separation of amino acids, peptides, and 
proteins was evaluated using amino acid and peptide 
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Figure 2 Partial GPC chromatogram at 215 nm of standard 
amino acids on an Aquagel-OH column. 
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standards covering a range of molecular masses. Aqua 
gel-Ol 1 is compatible only with aqueous mobile phase 
compositions containing up ro a max i mum of 50% 
methanol. Initial studies by GPC-MS revealed poor peak 
shape's with methanol present. The host peak shapes 
were obtained using a purely aqueous mobile phase 
buffered with etharioic acid at. pi I 3.6 (see Experimental 
Procedures). 

Given the effective nominal exclusion range of the 
column (100-30000 Da), amino acids would nor he 
expected to exhibit a high degree of separation through 
a size exclusion mechanism. The standard mixture of 
16 amino acids and the cli peptide cystine was chroma to- 
graphed to examine the effect on retention behavior of 
the various amino acid side chains. The basic amino 
acids (R. K. and H). the side chains of which will be 
protonat.ed at pi I 3.6. were the earliest to eluie (Figure 
2). Amino acids with alky I side chains (V. !\ f. and I.) 
or continuing sulfur (cystine and M) eluted next, fol 
lowed by the amino acids containing a carboxyl (D and 
F) or hydroxy I functionality (T). The latest eluting 
amino acids were those possessing an aromatic side 
chain (Y and F). Although they were present in the 
standard mixture at the same concentration as the other 
amino acids, MH T ions for A. G. and S were not 
observed. 

Oligomers of glycine (GG. GGG. and GGGG) were all 
ret ained to a similar degree (o< ca. 17 min) but with GG 
eluting slightly later than GGG and GGGG (Figure 3a- 
c), revealing low selectivity for GPG in this mass range. 
Clearly, therefore, for low-mass species, such as the 
amino acids and glycine oligomers, the column exhibits 
its most significant activity in the partition chromatog- 
raphy mode. The ability to form tt-ti interactions 
between polystyrene -di vinyl benzene and aromatic 
moieties is believed to be responsible for the particularly 
long retention of the aromatic amino acids. 

The octapeptide angiotensin II (DRVYIHPF) eluted 
later {t R — 18 rnin) than would be expected on the basis 



of molecular si/.e alone (Figure 3d). The strong r etention 
of angiotensin is influenced by the compet ing effects of 
the protonat.ed hydro phi lie amino acids K and 11 and 
the aromatic amino acids Y and F. Thus, on this GFC 
stationary phase peptides, as well as amino acids, are 
separated not only by their si^c but also by hydrophobic- 
type interactions, particularly those including aromatic 
residues. 

Peptide sequence information can bo obtained by \iS~ 
MS using collision-induced fragmentation of the proro- 
nared molecules and matching of the MS/MS product 
ion distributions (16, 17) to b ions (charge-retaining 
fragments from the amino terminus) and y-ions (charge- 
retaining fragments from the car boxy terminus) pre 
dieted from known protein sequences (IS. Iff). 

The isolation and fragmentation of a standard of 
angiotensin II (Mil" rnlz 1040) gave a product ion 
spectrum (MS/MS) arising mainly from cleavage of 
peptide bonds. Thus, b- and y-ions arising from sequen- 
tial loss of amino acids from the peptide were present. 
Analysis using SKQUKST against a human specific 
sequence subset extracted front the OWL database 
accurately predicted the amino acid sequence (Figure 
4). The raw correlation score (Xcorr) was 2.03. and the 
preliminary raw score (Sp) was 309. hot Ft of which 
correspond to good statistical matches and signify a high 
probability of a correctly identified peptide sequence. 
The difference (DelCn) between the scores for the top- 
ranked and the second -ranked peptides was above 0.1. 
indicating a high probability of a correct match (16. 25). 

Cocoa Extracts. Separ ation of cocoa components by 
GPC (Figure 5) revealed a complex distribution of 
components with peaks in the same elution window for 
the amino acid and peptide standards (fa values be- 
tween 1 2 and 25 min). On-line UV-vis and MS spectra 
allowed identification of the peaks with high intensity 
in the UV chromatogram as methylxanthines (theobro- 
mine and caffeine, Cr = 27 and 45 rnin, respectively). 
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C li««'i"..iMn Uims for rhf , difrerci|l days (jf rermenraiion 
rev,.;,! .!..„,..«.* in the relative abundances of compo- 
nents , ?l M-inn where peptides ehite (Heurc 5a) 
su-nesMM;: !h;„ hyrfi olysis or polypeptides results in the 
gcniT.UHKioi iiR-1-oa.sinRqu.-inriticss of small peptides and 
amino .it ids over the fermentation period. 

I he h , M . ; ,k mass rhmmaiogram of the fermented 
extra* t hn„, day ;, (Fi !4ul -<, 5b) shows species that form 
ions m «•:„(<■ mainly between 12 and 25 min. that is in 
the n-mnthM, time window where the standard amino 
acids ...hI peptides were found to elute. and reveals 
«reaif;r complexly than is apparent from the UV 
rhromarni;!..,,,. 1 wo of U„. most prominent ions were 
at nusUA and 902. Both ions show maximum intensity 
•it A: - /U nun. significantly later than the octapentide 
»"«'««t-i:sin H (Fio un . 3d). su S gestin» them to he 
hydrophobic m character and likely to contain aromat ic 
resHlues. i roducr ion spectra generated from m/rC>?) 
and JO*. WI.TL- analyzed using SI- QUEST to extract 
matches with sequences of cocoa viciliri anrl albumin 
proteins. Database subsets for the cocoa proteins wen* 
created to facilitate the search. J lie search results wcie 
examined borh for close matches between the experi- 
mental and predicted monoisotopic molecular masses 
and lor matches between predicted b- and y-ions with 
those in the MS/MS spocrra. " 

The product ion spectrum (Figure GaJ of /??/* UQV „ ave 
a match to a nonapeptidc sequence (AFLSPGDVF) in 
cocoa vicilin protein. SYQUEST search results (fermcn- 
tation clay 6) gave Xcorr = 1.82, Sp = 413. and DelCn 
- - 0-77. The product ion spectrum of miz 621 p avc a 
match to a hexapepride (SPGDVF) correspond inn to a 
sequence within the nonapeptidc (Figure 6b). The best 
match statistics, obtained for the sample at fermenta- 
tion day 6. were Xcorr = 1.31. Sp = 258. and DelCn - 
U.75. n both cases the match statistics indicate a high 
probability for the match, with similar or better confi- 
dence levels than for angiotensin II. Furthermore both 
peptide sequences contain only amino acids with alkyl 
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identifications The two peptides have been re none 

r, ™ ■ ( ) " In U,at s,U(, y' «»«»fatted West African 
cocoa beans were inrubaiccl in vitro with incl ™ I 
«x»a enzymes in acidic buffer and the noriiSSKs 
isolated and characterized by Kdman degSa on 

SPGDVF and SPGDVn" ° f "V™ prides (API..- 
Z^l V ^PGDVb) was evaluated from plots of 

nTA?'^. TonV^ CPC - MS mass chromato^ o 
tSnn .> 902 (ri8Ure 7) over tne seven-day ferineo 
tatior . J he two peptides have very similar max mum 
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fermentation / day 

Fi"ure 7 Abundance profiles for iwo peptides formed during 
cotton fermentation: (- - -) ml?. 902.4: (-) inly. 62 1.3 (error bars 
represent standard deviation on rhroo replicates). 

complex rhan would be expected by cither simple 
parallel or sequential pathways for formation and decay 
of i lu* iwu peptides. Given the homology in the amino 
iuid sequence of the two peptides, their time course 
siuth- suggests that lhe hexapepride is Conned from the 
nonnpepVidc as the fermentation progresses. 

Conclusion. The hyphenated technique of (.PC MS 
!ias been shown to he extremely povveiTul lor both 
NtMjui-nt ing and determining changes in abundance 
love is of peptides in complex mixtures. This paper 
confirms the identification of two peptide products (a 
hoxapeptide and a nonapept ide) and demonst rates, for 
the first lime, their formation from cocoa vicilin protein 
dm inp, the fermentation step used in chocolate manu- 
facture. Cocoa proteins undergo extensive hydrolysis 
during fermentation with consequent increases in pep- 
tide abundances. Peptide abundance profiles as a func- 
tion of lime indicate thai the he.xapepiide (which 
exhibits sequence homology with the nonapept -ide) is 
formed, in part, from the nnnapepr ide as the fermenta- 
tion progresses. Future work will be designed to ident ify 
ihe structures of other peptides, follow their time course 
during fermentation, and clarify in det ail ihe nature of 
react ions involved in fermentation. 
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